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DETAILED ACTION 
Claim Rejections - 35 USC § 103 

1 . The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 

obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

2. Claims 23-31, 33-45, 47-91 and 93-100 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over LADD (U.S. Patent 6,269,336). 

As to claim 23, LADD teaches a conversational browser (voice browser), 
comprising: means for interpreting a user command (voice input) and for generating a 
request (content request) to access a CML file (markup language document), wherein 
CML comprises meta-information implementing a conversational dialog for interaction 
with the user in a plurality of user interface modalities (col. 3, lines 40-46; col. 4, lines 
36-43; col. 4, lines 52-58); and a CML processor (parsing unit) for parsing and 
interpreting a CML file to render the conversational dialog in one or more of the plurality 
of user interface modalities (col. 11, lines 25-49; col. 11, line 66 - col. 12, line 24; col. 3, 
lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58). It would be obvious to one skilled in 
the art that browser means for interpreting is a voice interface for receiving the voice 
commands. 

As to claims 24 and 25, LADD teaches a conversational browser (voice browser) 
of a computing device that provides a conversational user interface to render a 
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conversational dialog (col. 11, lines 25-49). LADD also teaches that variations and 
modifications may be practiced on the system (col. 2, lines 10-14). However, LADD 
does not teach that the browser executes on top of an operating platform. Official 
Notice is taken in that it is well known in the art that a browser executes on a virtual 
machine to send and handle remote request and therefore would be obvious in view of 
LADD in order to send and handle voice requests. 

As to claims 26-29, LADD teaches a dialog manager (VRU server / interpreter 
unit) for managing and controlling the conversational dialog wherein the dialog manager 
allocates conversational engines (test to speech unit / automatic speech recognition 
unit) for rendering the conversational dialog by meta-information of a CML file (col. 9, 
lines 1-53; col. 13, lines 41-60). 

As to claims 30, 31 , 33 and 34, LADD teaches the user input command (voice 
input) can be input in the one or more user interface modalities (col. 1 1 , lines 31-35; col. 
3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58; col. 2, lines 48-66), the CML is 
implemented in a declarative format encapsulating multi-modal dialog (col. 16, lines 5- 
56). Official Notice is taken in that it is well known in the art that XML is a markup 
language and therefore would be obvious that the markup language of LADD is XML. 

As to claims 35-38, LADD teaches the input commands to the browser are voice 
commands (col. 1 1 , lines 26-36). Therefore, it would be obvious to one skilled in the art 
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that the since the commands are voice commands that navigates to a web page that the 
browser implements a "what you hear is what you can say", a "say what you heard", a 
"say what you will hear", and a "mixed initiative dialog formats. 

As to claim 80, LADD teaches a method for accessing information, comprising 
the steps of: processing an input command (voice input) with at least one of a plurality 
of conversational engines (network fetcher); generating a request (content request) 
based on the processed input command (voice input) to access a CML file (markup 
language document) from a content server (mark up language server), the CML file 
comprising meta-information to implement a conversational dialog in a plurality of user 
interface modalities (col. 3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58); 
transmitting the request (content request) and accessing the requested CML file from a 
content server using a standard networking protocol; and processing the meta- 
information comprising the CML file to render the conversational dialog in one or more 
of the plurality of user interface modalities (via parsing the information and executing the 
file using the browser to display and/or play sound) (col. 1 1 , lines 25-49; col. 1 1 , lines 
66 - col. 12, line 25; col. 14, lines 3-17; col. 2, lines 20-39; col. 2, line 59 - col. 3, line 
5). 

As to claims 81 and 82, LADD teaches a conversational browser (voice browser) 
of a computing device executes the steps (col. 1 1 , lines 25-49). LADD also teaches 
that variations and modifications may be practiced on the system (col. 2, lines 10-14). 
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However, LADD does not teach that the browser executes on top of an operating 
platform. Official Notice is taken in that it is well known in the art that a browser 
executes on a virtual machine to send and handle remote request and therefore would 
be obvious in view of LADD in order to send and handle voice requests. 

As to claims 84 and 85, LADD teaches customizing the CML file (markup 
language document) based on the conversational capabilities of the browser (the 
structure of the language can be designed specifically for voice applications); and 
registering the capabilities with the content server (via storing the files on markup 
language servers) (col. 15, line 60 - col. 16, line 21). 

As to claim 83, LADD teaches the steps are distributed using a conversational 
engine (test to speech unit / automatic speech recognition unit) and conversational 
arguments (request data / document attributes) (col. 1 1 , lines 25-49; col. 9, lines 1-53; 
col. 13, lines 41 -60). 

As to claim 86-88, LADD teaches transcoding legacy content of the content 
server (information from the information sources) into CML based on predefined 
transcoding rules (via the parser unit) (col. 12, lines 15-24; col. 5, lines 8-11). 

As to claim 89, LADD teaches processing the meta-information comprises 
playing back an audio file or generating synthesized speech output (col. 4, lines 50-61 ). 
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As to claims 90, 91 and 93, LADD teaches the CML is implemented in a 
declarative format encapsulating multi-modal dialog (col. 16, lines 5-56). Official Notice 
is taken in that it is well known in the art that XML is a markup language and therefore 
would be obvious that the markup language of LADD is XML. 

As to claims 94-100, LADD teaches the CML (via markup language document) 
comprises one of (1 ) a top level element that groups other CML elements; (2) an 
element that specifies output to be spoken to the user (3) a menu element for 
encapsulating a menu that presents the user with a list of choices wherein each choice 
is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that 
allows the user to input at least one item of information and transmit the at least one 
item of information to a target address; and (5) a combination thereof (col. 16, lines 29 - 
col. 17, line 49). 

As to claim 39, LADD teaches a system for accessing information (information), 
comprising: a content server (mark up language server) comprising content pages 
(mark up language documents), wherein the content pages are implemented using a 
CML (mark up language) to describe a conversational dialog for interaction with a user 
in a plurality of user interface modalities (view and audio) (col. 15, line 60 - col. 16, line 
57; col. 3, lines 40-46; col. 4, lines 36-43; col. 4, lines 52-58); and a conversational 
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browser (voice browser) for processing a CML page received from the content server to 
render its conversational dialog in one or more of the plurality of user interface 
modalities (col. 11, lines 25-49; col. 11, line 66 -col. 12, line 24; col. 3, lines 40-46; col. 
4, lines 36-43; col. 4, lines 52-58). However, LADD does not teach that the browser 
executes on top of an operating platform. Official Notice is taken in that it is well known 
in the art that a browser executes on a virtual machine to send and handle remote 
request and therefore would be obvious in view of LADD in order to send and handle 
voice requests. 

As to claims 40-44, LADD teaches the system comprises an IVR system 
implemented in CML (system capable of handling a voice markup language document) 
(col. 11, lines 25-49; col. 14, lines 3-17) and accessibly over a packet-switched network 
using a standard network protocol (col. 2, lines 26-39). 

As to claims 45 and 47-51 , LADD teaches the CML is implemented in a 
declarative format encapsulating multi-modal and speech dialog (col. 16, lines 5-56; col. 
16, line 58- col. 17, line 49). Official Notice is taken in that it is well known in the art 
that XML is a markup language and therefore would be obvious that the markup 
language of LADD is XML. 

As to claims 52-54, LADD teaches a conversational browser (voice browser) on a 
computing device communicating over a communications network (col. 11, lines 25-49). 
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LADD also teaches that variations and modifications may be practiced on the system 
(col. 2, lines 10-14). However, LADD does not teach that the browser executes on top 
of an virtual machine. Official Notice is taken in that it is well known in the art that a 
browser executes on a virtual machine to send and handle remote request and 
therefore would be obvious in view of LADD in order to send and handle voice requests. 



As to claims 55 and 56, LADD teaches standard network protocols are utilized for 
accessing CML content pages from the content server (col. 5, lines 37-62; col. 2, lines 
26-39). 

As to claims 57-62, LADD teaches transcoding legacy content of the content 
server (information from the information sources) into CML based on predefined 
transcoding rules (via the parser unit) (col. 12, lines 15-24; col. 5, lines 8-11). 

As to claims 63-71, LADD teaches CML (via markup language document) 
comprises a plurality of capability-based frames, an active link, a link to conversational 
data files, a link to at least one distributed conversational engine, a link to an audio file 
for playback, a confirmation message tag, TTS markup, scripting language and 
imperative code, and a link to one of a plug-in or an applet for executing a 
conversational task (col. 16, line 29 - col. 17, line 49). 
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As to claims 72-79, LADD teaches the CML (via markup language document) 
comprises one of (1) a top level element that groups other CML elements; (2) an 
element that specifies output to be spoken to the user (3) a menu element for 
encapsulating a menu that presents the user with a list of choices wherein each choice 
is associated with a target address identifying a CML element to visit if the 
corresponding choice is selected; (4) a form element for encapsulating a form that 
allows the user to input at least one item of information and transmit the at least one 
item of information to a target address; and (5) a combination thereof (col. 16, lines 29 - 
col. 17, line 49). 

Response to Arguments 

3. Applicant's arguments filed 5/23/05 have been fully considered but they are not 
persuasive. As to claims 23-31, 33-45, 47-91, and 93-100, Applicant argues that Ladd 
does not disclose or suggest conversational browsers or system for processing CML 
documents which comprise meta -information to enable interaction with the user in a 
plurality of user interface modalities to render the dialog in one or more user interface 
modalities. Applicant states that Ladd merely discloses a system in which a voice 
browser is capable of processing speech markup file and rendering a speech/audio 
interface only. The examiner disagrees. Ladd states that the network access 
apparatus of the system allows the user to access (i.e., view and/or hear) the 
information retrieved from the information source wherein the information is in the form 
of machine readable data, human readable data, audio or speech communications, 
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textual information, graphical or image data, etc (col. 3, lines 40-46). The output can 
include a speech communication, textual information, and/or graphical information (col. 
4, lines 50-58). Because the output information is both spoken and displayed there 
exist a plurality of user interface modalities, i.e. visual and audio. Therefore, Ladd 
teaches the browser system for processing CML documents having meta information 
(requested information) to enable interaction with the user in a plurality of user interface 
modalities as disclosed in the claims. Therefore, the rejection is maintained as detailed 
above. 



Conclusion 

4. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 .1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Lewis A. Bullock, Jr. whose telephone number is (571) 
272-3759. The examiner can normally be reached on Monday-Friday, 8:30 - 5:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Meng An can be reached on (571 ) 272-3756. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). 
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PRIMARY EXAMMER 
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